Assessing the effect of physical differences in the articulation of consonants and vowels on audiovisual temporal perception

نویسندگان

  • Argiro Vatakis
  • Petros Maragos
  • Isidoros Rodomagoulakis
  • Charles Spence
چکیده

We investigated how the physical differences associated with the articulation of speech affect the temporal aspects of audiovisual speech perception. Video clips of consonants and vowels uttered by three different speakers were presented. The video clips were analyzed using an auditory-visual signal saliency model in order to compare signal saliency and behavioral data. Participants made temporal order judgments (TOJs) regarding which speech-stream (auditory or visual) had been presented first. The sensitivity of participants' TOJs and the point of subjective simultaneity (PSS) were analyzed as a function of the place, manner of articulation, and voicing for consonants, and the height/backness of the tongue and lip-roundedness for vowels. We expected that in the case of the place of articulation and roundedness, where the visual-speech signal is more salient, temporal perception of speech would be modulated by the visual-speech signal. No such effect was expected for the manner of articulation or height. The results demonstrate that for place and manner of articulation, participants' temporal percept was affected (although not always significantly) by highly-salient speech-signals with the visual-signals requiring smaller visual-leads at the PSS. This was not the case when height was evaluated. These findings suggest that in the case of audiovisual speech perception, a highly salient visual-speech signal may lead to higher probabilities regarding the identity of the auditory-signal that modulate the temporal window of multisensory integration of the speech-stimulus.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cross-Modal Fusion: Context Effects in Lexical Words

The study focuses on the response of participants to audiovisual presentations of talking heads, and examines the effect of noise and temporal misalignment of channels in English monosyllabic words. The results show that McGurk fusion of phonetic segments is sensitive to the linguistic context of a segment: coda consonants elicit fusion more frequently than onset consonants and short vowels eli...

متن کامل

Visual Cues Contribute Differentially to Audiovisual Perception of Consonants and Vowels in Improving Recognition and Reducing Cognitive Demands in Listeners With Hearing Impairment Using Hearing Aids.

Purpose We sought to examine the contribution of visual cues in audiovisual identification of consonants and vowels-in terms of isolation points (the shortest time required for correct identification of a speech stimulus), accuracy, and cognitive demands-in listeners with hearing impairment using hearing aids. Method The study comprised 199 participants with hearing impairment (mean age = 61....

متن کامل

Perception of Place-of-Articulation Distinctions: Common Representation for Vowels and Consonants

We look for a common perceptual representation of place-of-articulation distinctions between stop consonants and those between closed vowels. This representation can be either acoustic or articulatory in nature but not both acousticarticulatory because the acoustic consequences of front to back articulatory changes are inverted for stops and vowels. Identification data show that the perceptual ...

متن کامل

Cross-modal integration during vowel identification in audiovisual speech: a functional magnetic resonance imaging study.

To investigate the neural substrates of the perception of audiovisual speech, we conducted a functional magnetic resonance imaging study with 28 normal volunteers. We hypothesized that the constraint provided by visually-presented articulatory speech (mouth movements) would lessen the workload for speech identification if the two were concordant, but would increase the workload if the two were ...

متن کامل

Audiovisual perception of congruent and incongruent Dutch front vowels.

PURPOSE Auditory perception of vowels in background noise is enhanced when combined with visually perceived speech features. The objective of this study was to investigate whether the influence of visual cues on vowel perception extends to incongruent vowels, in a manner similar to the McGurk effect observed with consonants. METHOD Identification of Dutch front vowels /i, y, e, Y/ that share ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2012